AITopics

Neural Information Processing SystemsFeb-10-2026, 02:59:42 GMT

Finite-TimeAnalysisofAdaptiveTemporalDifference LearningwithDeepNeuralNetworks

Nevertheless, theperformance guarantee of adaptive TD with neural network approximation remains widely unknown.

approximation, machine learning, reinforcement learning, (16 more...)

Country:

Asia > China (0.05)
North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)

Neural Information Processing SystemsDec-24-2025, 13:47:07 GMT

Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks

Temporal difference (TD) learning with function approximations (linear functions or neural networks) has achieved remarkable empirical success, giving impetus to the development of finite-time analysis. As an accelerated version of TD, the adaptive TD has been proposed and proved to enjoy finite-time convergence under the linear function approximation. Existing numerical results have demonstrated the superiority of adaptive algorithms to vanilla ones. Nevertheless, the performance guarantee of adaptive TD with neural network approximation remains widely unknown. This paper establishes the finite-time analysis for the adaptive TD with multi-layer ReLU network approximation whose samples are generated from a Markov decision process. Our established theory shows that if the width of the deep neural network is large enough, the adaptive TD using neural network approximation can find the (optimal) value function with high probabilities under the same iteration complexity as TD in general cases. Furthermore, we show that the adaptive TD using neural network approximation, with the same width and searching area, can achieve theoretical acceleration when the stochastic semi-gradients decay fast.

adaptive td, adaptive temporal difference learning, approximation, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceNov-4-2025

Sharp Lower Bounds for Linearized ReLU^k Approximation on the Sphere

Mao, Tong, Xu, Jinchao

We prove a saturation theorem for linearized shallow ReLU$^k$ neural networks on the unit sphere $\mathbb S^d$. For any antipodally quasi-uniform set of centers, if the target function has smoothness $r>\tfrac{d+2k+1}{2}$, then the best $\mathcal{L}^2(\mathbb S^d)$ approximation cannot converge faster than order $n^{-\frac{d+2k+1}{2d}}$. This lower bound matches existing upper bounds, thereby establishing the exact saturation order $\tfrac{d+2k+1}{2d}$ for such networks. Our results place linearized neural-network approximation firmly within the classical saturation framework and show that, although ReLU$^k$ networks outperform finite elements under equal degrees $k$, this advantage is intrinsically limited.

artificial intelligence, machine learning, neural network, (15 more...)

2510.0406

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsAug-16-2025, 07:06:44 GMT

Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks

However, from the theoretical perspective, establishing theoretical convergence guarantees for training DNNs is much more complicated than that for the linear approximation algorithms, which is still widely open.

approximation, machine learning, reinforcement learning, (16 more...)

Country:

Asia > China (0.05)
North America > United States > Utah (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Hur, Youngmi, Lim, Hyojae, Lim, Mikyoung

Provable wavelet-based neural approximation

arXiv.org Machine LearningApr-23-2025

Provable wavelet-based neural approximation Youngmi Hur Hyojae Lim Mikyoung Lim April 24, 2025 Abstract In this paper, we develop a wavelet-based theoretical framework for analyzing the universal approximation capabilities of neural networks over a wide range of activation functions. Leveraging wavelet frame theory on the spaces of homogeneous type, we derive sufficient conditions on activation functions to ensure that the associated neural network approximates any functions in the given space, along with an error estimate. These sufficient conditions accommodate a variety of smooth activation functions, including those that exhibit oscillatory behavior. Furthermore, by considering the L 2 -distance between smooth and non-smooth activation functions, we establish a generalized approximation result that is applicable to non-smooth activations, with the error explicitly controlled by this distance. This provides increased flexibility in the design of network architectures. 1 Introduction Neural networks have long been recognized for their remarkable ability to approximate a wide range of functions, enabling state-of-the-art achievements across various fields in machine learning and artificial intelligence, image processing, natural language processing, and scientific computing (see, for example, [13, 19] and references therein). Various activation functions, such as ReLU, Sigmoid, Tanh, and oscillatory functions, have also been explored to further enhance network performance and adaptability. The versatility of neural networks originates from the structural flexibility of architectures that combine affine transformations with nonlinear activation functions. In addition, classical universal approximation theorems [5, 12, 16] provide a theoretical basis for this flexibility by guaranteeing that, under suitable conditions, neural networks can approximate any continuous function on a bounded domain, underscoring their representational power. These seminal results have been extended along various directions, including radial basis function (RBF) networks [22, 25], non-polynomial activations [20], approximation of functions and their derivatives [15, 21], the influence of network depth [9], approximation error bounds [1], convolutional neural networks (CNN) [32], recurrent neural networks (RNN) [27]. As neural network architectures continue to evolve and diversify in practice, their theoretical foundations-beyond those provided by classical approximation theorems-have attracted Department of Mathematics, Yonsei University, Seoul 03722, Republic of Korea (yhur@yonsei.ac.kr)

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Machine Learning

2504.16682

Country:

Asia > South Korea > Seoul > Seoul (0.24)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Ohio > Portage County > Kent (0.04)
(2 more...)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Neural Information Processing SystemsJan-14-2025, 22:23:01 GMT

Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks

adaptive td, adaptive temporal difference learning, approximation, (5 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Petersen, Philipp, Zech, Jakob

Mathematical theory of deep learning

arXiv.org Artificial IntelligenceJul-25-2024

It is designed to help students and researchers to quickly familiarize themselves with the area and to provide a foundation for the development of university courses on the mathematics of deep learning. Our main goal in the composition of this book was to present various rigorous, but easy to grasp, results that help to build an understanding of fundamental mathematical concepts in deep learning. To achieve this, we prioritize simplicity over generality. As a mathematical introduction to deep learning, this book does not aim to give an exhaustive survey of the entire (and rapidly growing) field, and some important research directions are missing. In particular, we have favored mathematical results over empirical research, even though an accurate account of the theory of deep learning requires both.

approximation error, neural network approximation, universal approximation theorem, (16 more...)

2407.18384

Country:

North America > Canada > Ontario > Toronto (0.13)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.13)
North America > United States > New York > New York County > New York City (0.04)
(24 more...)

Genre:

Summary/Review (1.00)
Research Report (1.00)
Overview (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting (0.65)
Leisure & Entertainment > Games (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.45)

Schotthöfer, Steffen, Laiu, M. Paul, Frank, Martin, Hauck, Cory D.

Structure-preserving neural networks for the regularized entropy-based closure of the Boltzmann moment system

arXiv.org Artificial IntelligenceJun-1-2024

The main challenge of large-scale numerical simulation of radiation transport is the high memory and computation time requirements of discretization methods for kinetic equations. In this work, we derive and investigate a neural network-based approximation to the entropy closure method to accurately compute the solution of the multi-dimensional moment system with a low memory footprint and competitive computational time. We extend methods developed for the standard entropy-based closure to the context of regularized entropy-based closures. The main idea is to interpret structure-preserving neural network approximations of the regularized entropy closure as a two-stage approximation to the original entropy closure. We conduct a numerical analysis of this approximation and investigate optimal parameter choices. Our numerical experiments demonstrate that the method has a much lower memory footprint than traditional methods with competitive computation times and simulation accuracy.

closure, entropy closure, neural network, (16 more...)

2404.14312

Country:

North America > United States > Tennessee > Anderson County > Oak Ridge (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)

Genre: Research Report (0.50)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Jang, Deok-Kyu, Kim, Hyea Hyun, Kim, Kyungsoo

Enhanced physics-informed neural networks with domain scaling and residual correction methods for multi-frequency elliptic problems

arXiv.org Artificial IntelligenceNov-7-2023

A physics-informed neural network (PINN) combines the constraint-satisfaction ability of partial differential equations (PDEs) with the representation power of deep neural networks to learn solutions of PDEs. PINNs were first introduced in [3, 7, 11] as a way of solving problems in mathematical physics and engineering that can be modeled as PDEs. The idea behind PINNs is to treat the solution of a PDE as an unknown function that can be represented by a neural network. The neural network is then trained end-to-end to satisfy the boundary conditions and PDE constraints. This enables PINNs to deal with problems that are challenging to solve using conventional numerical techniques, such as, those with high-dimensional input spaces and complex boundary conditions. Due to the growing need for effective solutions to challenging physical problems in fields like fluid dynamics, structural mechanics, and heat transfer, PINNs have become increasingly popular in recent years. Computational and theoretical studies on PINNs have also shown to be useful for problems in machine learning, computer vision, and other fields outside physics and engineering due to their flexibility and representational power. PINNs have been applied to a variety of problems in physics, engineering, and other fields, including solving PDEs, modeling physical systems, and carrying out data-driven simulations. However, there are still some obstacles that arise when applying them to the field of computational science and engineering.

activation function, neural network, training epoch, (12 more...)

2311.03746

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)